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ABSTRACT 

In hierarchical models, density fluctuations on different scales are correlated. This 
induces correlations between dark halo masses, their formation histories, and their 
larger-scale environments. In turn, this produces a correlation between galaxy prop- 
erties and environment. This correlation is entirely statistical in nature. We show how 
the observed clustering of galaxies can be used to quantify the importance of this 
statistical correlation relative to other physical effects which may also give rise to cor- 
relations between the properties of galaxies and their surroundings. We also develop 
a halo model description of this environmental dependence of clustering. 

Key words: methods: analytical - galaxies: formation - galaxies: haloes - dark matter 
- large scale structure of the universe 



1 INTRODUCTION 



A number of physical mechanisms are expected to play a role 
in determining the properties of a galaxy: e.g., ram pressure 
stripping, harrassment, and strangulation (Gunn & Gott 
1972; Farouki & Shapiro 1980; Moore et al. 1996; Balogh & 
Morris 2000). Many of these operate in dense environments. 
So the existence of a morphology-density relation — the frac- 
tion of galaxies which have elliptical rather than spiral mor- 
phologies is higher in denser regions (Dressier 1980) — is 
not unexpected. For similar reasons, recent measurements 
of lower star-formation rates in denser regions, which ap- 
pears to persist even at fixed morphology (Balogh et al. 
2002; Gomez et al. 2003) and stellar mass (Kauffmann et 
al. 2004), are not unexpected. However, determining which, 
if any, of the physical mechanisms mentioned above is the 
dominant one is more difficult. 

Hierarchical galaxy formation models have been rather 
successful at reproducing the morphology-density relation 
(e.g. Benson et al. 2001). In these models, a correlation be- 
tween galaxy- type and environment arises even if none of 
the physical mechanisms mentioned above are present. The 
correlation is a consequence of the following assumptions. 
Gravity has transformed small fluctuations in the early Uni- 
verse into the structures we see today. This transformation 
was hierarchical, in the sense that small virialized objects 
formed first, and then merged with one another to form 
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more massive virialized objects at a later time. The virial- 
ized objects present at any given time, called dark matter 
haloes, are approximately 200 times denser than the back- 
ground universe at the time (Gunn & Gott 1972). Galax- 
ies form from gas which cools within virialized dark matter 
haloes (White & Rees 1978). The properties of a galaxy are 
determined entirely by the mass and formation history of 
the dark matter halo within which it formed (e.g. White 
& Frenk 1991; Kauffmann et al. 1997; Somerville & Pri- 
mack 1999; Cole et al. 2000). Halo masses and formation 
histories are directly related to the structure of the initial 
density fluctuation field from which they formed (Press & 
Schechter 1974; Lacey & Cole 1993; Sheth, Mo & Tormen 
2001). In hierarchical models, there is a correlation between 
fluctuations on different scales, and this induces correlations 
between halo mass and/or formation and the larger scale 
environment of a halo (Mo & White 1996; Lemson & Kauff- 
mann 1999; Sheth & Tormen 2002). This, in turn, induces a 
correlation between galaxy-type and environment. This cor- 
relation is entirely statistical in nature. So it is interesting 
to ask if this statistical correlation is sufficient to explain 
most of the observed correlation between galaxy-type and 
environment. 

The main goal of the present work is to show how anal- 
ysis of the clustering of galaxies can be used to quantify the 
strength of this statistical correlation between galaxy prop- 
erties and environment. The idea is to show the strength 
of the statistical effect alone: any discrepancy with observa- 
tions can then be ascribed to the other physical processes 
(Sheth, Abbas & Skibba 2004). Although our argument is 
general, we focus in particular on the two-point correlation 
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function, f (r), and show how it is expected to depend on en- 
vironment if the only environmental effects on galaxy prop- 
erties are those which arise from the statistics of the initial 
fluctuation field. The present analysis is intended to comple- 
ment traditional measures of the environmental dependence 
of clustering which tend to focus on correlations between 
the distribution of an observable (e.g., the luminosity func- 
tion, or the star formation rate, etc.) and the environment 
(see e.g. Yang et al. 2003; Mo et al. 2004; Kauffmann et al. 
2004; Balogh et al. 2004; Blanton et al. 2004, for some recent 
analyses) . 

This paper is arranged as follows. Section 2 uses nu- 
merical simulations to illustrate how clustering is expected 
to depend on environment if the entire environmental depe- 
dence arises from the correlation between haloes and their 
environments. Section 2.1 shows the effect for dark matter. 
A simple toy model which captures most of the relevant fea- 
tures of this effect is presented in Section 2.2. Section 2.3 
uses mock galaxy samples to show that the effect should 
be easily measured in surveys such as the SDSS. The final 
section summarizes our results and discusses some implica- 
tions. An Appendix shows how to describe the effect using 
the language of the halo model (see Cooray & Sheth 2002, 
for a review) of large scale structure. 



2 ENVIRONMENTAL DEPENDENCE OF £ 

Throughout, we show results for a flat ACDM model for 
which (n ,h,a 8 ) = (0.3,0.7,0.9) at z = 0. Here tt is the 
density in units of critical density today, the Hubble con- 
stant today is Ho = lOO/i km s _1 Mpc -1 , and ag describes 
the rms fluctuations of the initial field, evolved to the present 
time using linear theory, when smoothed with a tophat filter 
of radius 8/i _1 Mpc. The GIF and VLS numerical simula- 
tions we use to illustrate our arguments were made avail- 
able to the public by the Virgo consortium. Both were run 
with the same ACDM cosmology, but with slightly different 
parameterizations of the initial fluctuation spectrum. The 
GIF simulation had 256 3 particles in a cubic box with sides 
L = 141/i _1 Mpc. The VLS simulation had 512 3 particles in 
a cubic box with sides L = 479/i~ Mpc. 



2.1 Dark matter simulations 

To illustrate how clustering depends on environment, we 
used the halo and particle distributions in the ACDM GIF 
simulations (Kauffmann et al. 1999). The simulations con- 
tain approximately 90,000 haloes which each have at least 
ten particles, where the virialized halos were found using a 
friends-of- friends group finding program. We defined the en- 
vironment of a halo by using the mass Mr within a sphere of 
radius R centred on the halo. We set R = 5/i -1 or 8ft" 1 Mpc, 
but any value which is substantially larger than the typical 
virial radius (a few hundred kpc), but smaller than the scale 
on which the Universe is essentially homogeneous, will do. 
We then ranked all the haloes in decreasing order of Mr. 
The particles belonging to the top one-third of the haloes 
were labeled as belonging to densest environments, and the 
particles in the bottom one-third of the halo sample were la- 
beled as belonging to the least dense environments. Finally, 
we computed the correlation function of particles belonging 
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Figure 1. Environmental dependence of the dark matter correla- 
tion function in the ACDM GIF simulation. Triangles show §(r) of 
particles in haloes which were denned as having the densest envi- 
ronments (defined by counting the mass within a sphere of radius 
of 5h~ x Mpc centred on each halo), and squares are from particles 
in underdense regions. Smooth curves show the analytic model for 
this environmental effect that is developed in Appendix A. 



to the haloes in the densest and least dense regions. The 
results are shown in Figure 1. 

There are obvious differences between the two correla- 
tion functions. The correlation function for the particles in 
dense regions extends to larger scales; if fit to a power-law, 
it would have a shallower slope. The next section describes 
a simple model for these differences. The smooth curves in 
the figure show the result of a more complete analytic model 
that is developed in Appendix A. 

2.2 A toy model 

Let dn(m,S c )/dm denote the number density of dark mat- 
ter haloes with mass m at a time when the linear the- 
ory overdensity required for spherical collapse is S c , and 
let N(m, S C \M, V) denote the average number of m haloes 
in regions of volume V which contain mass M. Define 
M/V = p(l + 5), where p denotes the average density of 
the background. Dense regions have S > 0. Mo & White 
(1996) showed that a generic feature of hierarchical mod- 
els is that N(m, 5 C \M, V) ^ (1 + 5) V dn/dm: i.e., dark halo 
abundances in dense and underdense regions do not differ by 
a simple factor which accounts for the difference in density. 
Rather, 

N(m, S C \M, V) « [l + b(m)s] V^^, (1) 
d In dn(m, S c ) /dm 



where 



b(m,5 c ) w 1 



dS r 



(2) 



is a function which typically increases monotonically with m 
(e.g. Sheth & Tormen 1999). As a result, one expects the ra- 
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tio of the number of massive to low mass haloes to be larger 
in dense regions than in less dense regions: the mass function 
in overdense regions should be 'top-heavy'. Measurements in 
simulations indicate that this is indeed the case (e.g. Lem- 
son & Kauffmann 1999; Sheth & Tormen 2002): the average 
halo mass increases with 8. 

In the GIF simulations, the average mass of the ~ 
30, 000 haloes which reside in the densest regions is approx- 
imately m = 2.5 x lO 12 fe _1 M0, whereas the average mass 
of the ~ 30, 000 haloes which reside in the least dense re- 
gions is m — 5.2x 10 11 hT x Mq. (Hence, the two overdensities 
differed by a factor of approximately 5.) 

The fact that dense regions have a top-heavy mass func- 
tion has an important consequence for the environmental de- 
pendence of the correlation function which we now describe. 
Let £(r|<$) denote the shape of the correlation function in 
regions with overdensity 8. Although such regions contain 
haloes with a range of masses, suppose we require that all 
haloes have the same mass ms, chosen to match, say, the 
mean halo mass in the regions. If these haloes are not clus- 
tered, then £(r|(5) is simply a consequence of the shape of 
the density profiles p(r\8) around haloes (Peebles 1974): 

ar\S) = n clus {m s ) [ ds p(s + r\ms,S) ^ 

J Ps ps 

where n c i us (ms) is the average number density of haloes 
surrounded by regions with overdensity 8, ps = msn c i us , 
and ms = f ds p(s\ms,S). In fact, the haloes are clustered, 
but Sheth & Jain (1997) show why ignoring halo clustering 
should be a good approximation on small scales. Hence, to 
estimate the small scale correlations as a function of environ- 
ment, we require an estimate of the shapes of halo density 
profiles. 

When fit to spherical models, the density profiles 
of haloes in simulations are well fit by the functional 
form p(r\m)/p = A c (r vir /cr)/(l + cr/r vir ) 2 where r vir = 
(3m/47rA t , lr p) 1/3 and c = 9 (10 13 /i _1 Mo/m) 01 (Navarro et 
al. 1997). If we assume that the density profiles of m haloes 
are the same in all environments (we will modify this as- 
sumption shortly), then £(r|<5) is given by inserting this ex- 
pression for the density profile into the convolution integral 
above. The result is 

£(r\5) = ^ Vlr _ \(r\m s ,8), provided r < 2rw, (4) 
Ps/P 

where A is a messy function of c and cr/r V i r (given in Sheth 
et al. 2001). Since A vir is independent of m, in such a model, 
the environmental dependence of £(r|<5) comes entirely from 
the fact that dense regions host the more massive haloes, and 
halo density profiles depend on halo mass. The factor of ps/p 
in the denominator derives from the fact that haloes which 
have a fixed overdensity relative to the global background 
density p have a different overdensity relative to the local 
background ps- 

Figure 2 shows the result of this simple analytic esti- 
mate of £(r|<5) for the two sets of GIF simulation particles: 
those which reside in the 30,000 haloes with the largest sur- 
rounding overdensities (as described previously), and those 
which reside in the 30,000 haloes with the smallest surround- 
ing overdensities The curves are qualitatively similar to the 
measurements shown in Figure 1, at least out to scales on 
which the measurements show an inflection: £ falls to zero 
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Figure 2. The correlation function of particles in haloes which 
are surrounded by dense regions (curves which extend to larger r), 
and by less dense regions (curves which fall to zero at smaller r), in 
our toy model. The number of haloes in the two environments was 
assumed to be the same, but the halo mass in dense regions was 
assumed to be larger by a factor of 5. Solid curves have c = 10 in 
both cases, whereas dashed curves include a realistic prescription 
for the weak mass dependence of c. 

on smaller scales in the less dense regions. The inflections at 
~ 0.8/i~ 1 Mpc and ~ 3/i _1 Mpc in the simulations (Figure 1) 
denote the scales which are approximately twice the virial 
radii of the typical haloes in the two regions. Beyond this 
scale, halo-halo correlations become important; we build a 
model for this in Appendix A. 

The curves which extend to larger scales are those for 
particles in the denser regions. This is easily understood, 
because dense regions are those for which ms is larger, and 
hence r„; r is larger. The reason why £(r|<5) on small scales 
is larger for the less dense regions is more subtle. 

The solid curves show results in which we have set 
c = 10 and ignored the mass dependence of c, and dashed 
curves include the mass dependence but assume that there 
is no additional dependence on environment. Clearly, the 
mass dependence of c is not a dominant effect even on scales 
smaller than r vir /c. Thus, the difference in amplitudes on 
small scales derives from the factor of ps/p in the expres- 
sion above, and not from the mass dependence of c. In the 
present example, the number of haloes in the two (dense 
and underdense) regions is the same, but ms in dense re- 
gions is larger, so ps/p is larger for the dense regions. Since 
the shape of £ is approximately constant on small scales, it 
has a lower amplitude in denser regions. 

The relative unimportance of c has the following in- 
teresting consequence. Suppose that halo density profiles do 
depend on environment (numerical simulations are only just 
reaching the resolution required to address this question) . A 
simple way of parameterizing this dependence is to allow c 
to depend both on m and 8. If c depends only weakly on 8, 
then the effect on £ will only be noticeable on scales smaller 
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than r V i r /c, where c denotes the mean c (averaged over en- 
vironments) . 

On the smaller scales, where halo-halo correlations are 
not important, the differences between the two curves in 
Figure 1 are qualitatively like those of the simple toy model 
described above, indicating that our use of a mean density- 
dependent halo mass mj, rather than a distribution of 
masses, does capture the essential features of the density 
dependence of £(r|<5). On larger scales, where halo correla- 
tions are important, Figure 1 shows that £(r|(5) is stronger in 
dense regions. This is not unexpected in the context of the 
linear peaks-bias model of (Kaiser 1984), if, on average, the 
densest regions at the present time formed from the densest 
regions in the initial fluctuation field. This is because, in the 
initial Gaussian random field, the densest regions were more 
strongly clustered than regions of average density. Therefore, 
our simple model, in which the environmental dependence 
of £ is entirely due to the environmental dependence of the 
halo mass function, suggests the following generic features 
for £(r|<5): on scales larger than the virial radius of a typi- 
cal halo, the amplitude of £ should increase as 8 increases; 
on smaller scales, the amplitude of £ in underdense regions 
should be larger; hence, the slope of £ in dense regions should 
be shallower than in less dense regions. 

2.3 Mock galaxy samples 

To illustrate that the features described above really are 
generic, and to make a closer connection to observations, we 
assigned mock 'galaxies' to haloes in the simulations as fol- 
lows. Sufficiently low mass haloes contain no galaxies. Haloes 
more massive than some rtiL contain at least one galaxy. 
The first galaxy in a halo is called the 'central' galaxy. The 
number of other 'satellite' galaxies is drawn from a Poisson 
distribution with mean N s (m) where 



N s (m) 



m 
mi 



if m ^ tul- 



(•>) 



This procedure is motivated by Kravtsov et al. (2004). We 
distribute the satellite galaxies in a halo around the halo cen- 
tre so that the radial profile follows that of the dark matter 
(i.e., the galaxies are assumed to follow an NFW profile). We 
set m L = 10 11 - 27 h~ 1 M e , mi = 23m L , and a = 0.92; Zehavi 
et al. (2004) show that this choice is appropriate for SDSS 
galaxies more luminous than M r < —18. We then compute 
N5 , the number of galaxies in a 5ft" 1 Mpc sphere around each 
galaxy, and rank the galaxies in order of decreasing N5 . The 
top one-third are labeled as being galaxies in dense regions, 
and the bottom third as being in underdense regions. 

Figure 3 presents the correlation functions of the mock 
galaxies classified as being in dense (top) and underdense 
(bottom) regions. Once again we see the generic tendency for 
£ computed using objects in denser regions to be shallower 
than when objects in sparser environments are used. 

The important point, which we note explicitly here, is 
the following: By assuming that equation (5) is the same 
function of m for all environments, and by assuming that the 
radial profile of the galaxies depends only on halo mass and 
not on environment, we have constructed a galaxy catalog in 
which all environmental effects are entirely a consequence of 
the correlation between halo mass and environment. There- 
fore, the locii traced out by the two sets of symbols shown 
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Figure 3. Similar to Figure 1, but now for model galaxies dis- 
tributed according to the model described in the text around 
equation 5, and density defined by counting galaxies in spheres 
centred on the galaxies themselves, rather than the mass in 
spheres centred on haloes. Curves show the analytic model de- 
veloped in Appendix A. 



in Figure 3 represent the predicted environmental depen- 
dence of £ if there are no environmental effects other than 
the statistical one determined by the initial fluctuation field. 

Over the l-10/i _1 Mpc range which the SDSS data cur- 
rently probe most reliably, £ for the dense and underdense 
samples differs by an order of magnitude. This difference is 
easily measurable in data sets which are currently available. 
Comparison of this predicted difference with measurements 
in the SDSS will provide a sharp test of the assumption that 
environmental effects are dominated by the statistical cor- 
relation between the halo mass function and environment, 
rather than by other 'gastro'physics. 



3 DISCUSSION AND CONCLUSIONS 

In hierarchical models, the clustering of dark matter par- 
ticles should be a strong function of environment. This is 
a consequence of the fact that massive haloes populate the 
densest regions. If the properties of galaxies are determined 
entirely by the masses and formation histories of the haloes 
in which they sit, then the clustering of galaxies should also 
depend strongly on environment. 

We discussed a method for testing this assumption. The 
test is particularly interesting given recent work which sug- 
gests that, at fixed mass, haloes in dense regions form at 
higher redshifts (Sheth & Tormen 2004), and that this ef- 
fect is a decreasing function of halo mass (Gao, Springel & 
White 2005). In particular, Section 2.3 described how to gen- 
erate a mock galaxy catalog in which all correlations with 
environment are a consequence of the correlation between 
halo abundances and environment. Comparison of the envi- 
ronmental dependence of clustering in the mock catalog and 
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in real data allows one to test the assumption that environ- 
mental effects are dominated by the statistical correlation 
between halo mass and environment. 

The Appendix shows how to incorporate the assump- 
tion that environmental effects are determined entirely by 
the statistical correlation between halo mass and environ- 
ment into the halo model description of large scale structure. 
While this model aids in understanding the effect, to per- 
form the test with data, it is unnecessary — measurements 
of clustering in the mock catalogs of the sort described in 
Section 2.3 are sufficient. However, the methodology de- 
veloped in this work provides a means for computing the 
clustering properties of galaxies affected by other environ- 
ment dependent processes (assuming that some model for 
{N g ai\m, M, V) could be derived) that explicitly do not de- 
pend upon the surrounding halo mass. For instance, winds 
from galaxies in nearby clusters are assumed to depend on 
the larger scale environment and could affect the clustering 
statistics. 

In particular, on scales larger than a Megaparsec or so, 
the two-point correlation function £ of galaxies surrounded 
by high density regions is predicted to be larger than for 
galaxies in less dense regions. In addition, the slope of £ 
for galaxies in dense regions should be shallower (cf. Fig- 
ure 3). And if the distribution of galaxies around halo cen- 
ters depends only weakly on environment (current analyses 
of galaxy clustering assume any such dependence is negligi- 
ble), the effect on their clustering amplitude is small. 

Our test using mock catalogs is unrealistic in one re- 
spect: with real data, one only has redshift-space positions, 
so one cannot define the local density in real-space. How- 
ever, if the scale on which the redshift-space density is de- 
fined is larger than the typical size of redshift-space distor- 
tions (~ 5/i _1 Mpc), then the real and redshift-space den- 
sities will not differ substantially. Hence, we do not expect 
our separation into environments based on the real-space 
density within 8/i _1 Mpc spheres to differ substantially from 
that which we would have obtained had we used redshift- 
space positions instead. A more detailed description of the 
redshift space method is the subject of work in progress. 
(E.g., what does one gain by measuring both £(s) and the 
projected correlation function as a function of redshift space 
environment? The latter quantity depends on local density 
for similar reasons that £(r) does — the halo mass function 
depends on environment — whereas £(s) will have an addi- 
tional effect coming from the fact that redshift space distor- 
tions depend on halo mass.) Comparison of these predictions 
with clustering in the Sloan Digital Sky Survey is underway. 
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APPENDIX A: AN ANALYTIC MODEL 

This Appendix provides an analytic model which incorpo- 
rates the assumption that environmental effects are deter- 
mined entirely by the statistical correlation between halo 
mass and environment. Note that to perform the test with 
data, this analytic model is unnecessary — measurements of 
clustering in the mock catalogs described previously are suf- 
ficient. 

The analysis which follows uses the framework of the 
halo model of large scale structure (see Cooray & Sheth 
2002, for a review). It extends that of the toy model de- 
scribed in the main text in two ways: it allows for a range of 
halo masses, and it allows for correlations between haloes. 



where W(x) — (3/:r 3 )[sin(:r) — a;cos(a;)] and Ra = 
(3M/4np) 1/B . That is to say, a(M) is the rms value of the 
initial fluctuation field when it is smoothed with a tophat 
filter of comoving size Ro, extrapolated using linear theory 
to the present time. Here S sc is the critical density required 
for spherical collapse, extrapolated to the present time us- 
ing linear theory (it is 1.686 for an Einstein de-Sitter model), 
and a « 0.71, p = 0.3 and A = (1 + T(l/2 -p)/^/2 v y 1 ^ 
0.322. If a = 1, p = and A = 1/2, then dn/dm is the same 
as the universal mass function first written down by Press 
& Schechter (1974). 



Al The halo model for dark matter 

In this model, all mass is bound up in dark matter haloes 
which have a range of masses. Hence, the background density 
is 



/ 



dm 



dn(m) 
dm 



(Al) 



where dn(m)/dm denotes the number density of haloes of 
mass m. The correlation function is the Fourier transform 
of the power spectrum P(k): 
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dk k 3 P(k) sin kr 
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In the halo model, P(k) is written as the sum of two terms: 
one that arises from particles within the same halo and dom- 
inates on small scales (the 1-halo term), and the other from 
particles in different haloes which dominates on larger scales 
(the 2-halo term). Namely, 



P(k) = P lh (k) + P 2h (k), 



where 



.(*)=/ 



dm 
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dm p 
dn(m) m 



\u(k\m)\ 2 



dm 



— b(m) u(k\m) 
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(A4) 



P Li „(fc). 



Here u(k\m) is the Fourier transform of the halo density 
profile, b(m) is the bias factor which describes the strength 
of halo clustering, and Pun{k) is the power spectrum of 
the mass in linear theory. When explicit calculations are 
made, we assume that the density profiles of haloes have 
the form described by Navarro et al. (1997), and that halo 
abundances and clustering are described by the parameter- 
ization of Sheth & Tormen (1999): 
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p dm 
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A2 Including the environmental effect 

The expressions above are the result of averaging over en- 
vironments. Including the environmental dependence of the 
halo distribution explicitly is not entirely straightforward, 
because, as we describe below, it requires knowledge of the 
probability that a region of volume V has overdensity S. As 
we describe below, we use the excursion set model described 
in Sheth (1998) to do this. 

In this model, spherical evolution is described by a 
'moving barrier': 
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where A = M/pV. This barrier is said to be 'moving' be- 
cause, for general V, it is a function of M. The excursion 
set model attributes special significance to the first crossing 
distribution f(M, V)dM of this barrier by Brownian motion 
random walks: it is a measure of the mass fraction in regions 
of size V which contain mass M. In this approach, a halo 
can be thought of as a patch which has collapsed to van- 
ishingly small volume. In the limit of V — » 0, So — > S BC is 
the same constant for all M. In this limit the barrier is said 
to have constant 'height', and the first crossing distribution 
f(M,5 BC )dM represents the mass fraction in haloes of mass 
M. Thus, f(M,S sc ) equals (M/p) times dn(M, S sc )/dM, the 
halo mass function. 

Notice that, for general V, <5o(A) ^ <5 SC for all M. Hence, 
/(m, 5 ac \M, V), the first crossing distribution of S sc by walks 
which first crossed the moving barrier associated with non- 
zero V at M, denotes the mass fraction of cells of size V 
which contain mass M which is in haloes which have mass 
m for some m ^ M. This fraction equals (m/M) times 
N(m, 6ac\M, V), the environmental dependent mass function 
discussed in the main text (cf. equation 1). 

In the main text (e.g. Section 2.1), we define the en- 
vironment of a halo by considering the mass in a patch of 
volume V surrounding it. To classify haloes by their en- 
vironment, we must be able to estimate the number den- 
sity of haloes of mass m which are surrounded by regions 
of volume V which contain mass M. If dn(m)/dm denotes 
the number density of m haloes, and f(M,V\m) denotes 
the fraction of m haloes which contain mass M in the sur- 
rounding volume V, then the number density of such haloes 
is f(M,V\m)dn(m)/dm. In the excursion set model, this 



The environmental dependence of clustering in hierarchical models 7 



equals 

f( M , V\m)dM = f(m)f(M, V\m) dM 
dm m 

= £.f(M,V)f(m\M,V)dM ( A7 ) 

= n(M,V)N(m\M, V) dM, 

where f(M, V)dM is given by computing the first-crossing 
distribution of the moving barrier associated with spherical 
collapse described above. (In practice, we use the analytic 
approximation to the first crossing distribution of such mov- 
ing barrier problems given by Sheth & Tormen 2002). 

The mass density contributed by haloes that are em- 
bedded in regions of mass M min < M < M max is; 



dm^™> f(M,V\m)m 
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In the standard model, the density profile of a halo de- 
pends on its mass, but not on the surrounding environment. 
In this case, the one-halo term is 
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This reduces to the standard 1-halo term in the limits 
Mmin — * and M max — * oo. 

The two-halo term is more complex as it now has two 
types of contributions: pairs which are in the same patch 
(2-halo-l-patch), and pairs in different patches (2-halo-2- 
patch). The 2-halo-l-patch term can only be important on 
intermediate scales (i.e., those which are larger than the di- 
ameter of a typical halo but smaller than the diameter of a 
patch). It is more complex to model this term accurately, as 
we describe shortly. The 2-halo-2-patch term, on the other 
hand, is simpler. It should be well approximated by 
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where PLi n (k\R p ) denotes the power spectrum associated 
with setting the linear theory correlation function to —1 on 
scales smaller than the diameter of a patch 2R P . This trunca- 
tion has little effect on small kR p <C 1, where Pun_{k\R p ) « 
Phin(k). The factor B(M,V) describes bias associated the 
clustering of the patches, and depends on the abundance 
of such patches in exactly the same way that b(m) is re- 
lated to dn(m)/dm (c.f. equation 2). Note that this expres- 
sion assumes that the 2-halo-2-patch term is given simply 
by weighting the patch-patch correlation by the halo abun- 
dance within a patch. 

To a first approximation, patches do not overlap with 
one another, so the 2-halo-2-patch term should drop on 
scales smaller than the diameter of a patch. It is on these 



scales that P2h~i P should begin to dominate. A first approx- 
imation for the net effect of P 2 h-i P: then, is to not enforce 
this small-scale decrease of P2h-2 P , and to simply use the ex- 
pression above for P2h~2 P but Pu n (k) instead of Phi n {k\R p ), 
for all k. Section A4 shows that this expression reduces to the 



standard two-halo term when M„ 



and M„ 



oo, 



so it is at least a reasonable approximation. However, we will 
see below that the correlation function of objects in under- 
dense regions sometimes shows a clear signature of the fact 
that patches do not overlap. Thus, a more sophisticated ap- 
proximation is required to describe clustering in underdense 
regions. 

To estimate the 2-halo-l-patch term, it is convenient to 
think of the patches as haloes, and of the haloes as patch- 
substructure. Sheth & Jain (2003) have developed the halo 
model description of clustering when haloes have substruc- 
ture. They allow for the possibility that a halo may be made 
up of a smooth component plus a population of subclumps. 
Our present case corresponds to assigning all the mass to 
subclumps, and none to the smooth component, so that only 
the final two of the four terms in their equation (31) con- 
tribute. Our expression for P 1 f l (k\S) is effectively the same 
as their final term, so their third term is our P2h-i P (k\S). 
Namely, 



P2h-i P (k\S) = / dMn(M,V) 

J M mi „ 

x / dmi ( \u(k\mi)\ 
Jo \Ps J 

( M - m A (m 2 \. ... .. 

x / drri2 —r- \u(k\m2)\ 
Jo \Ps J 



(AH) 



x N(mi,m 2 \M,V) f/(fc|mi, m 2 , M) 2 



Here U denotes the normalized Fourier transform of the spa- 
tial distribution of mi and rri2 haloes within a patch. A 
simple first estimate would use the correlation function of 
the haloes to model this profile, but to truncate this at the 
patch radius, since both haloes are required to lie within the 
patch: 



U w b(m 1 )b(m2) 



Pun(k) - P Un (k\R p ) 
M/p 



(A12) 



where PLin(fc) — PLin(k\R p ) denotes the power spectrum as- 
sociated with setting the linear theory correlation function 
to zero on scales larger than the diameter of a patch 2R P . 
The other term, N(mi, m 2 |M, V), denotes the average num- 
ber of haloes of mass mi and m 2 in patches V which contain 
total mass M. Sheth & Lemson (1999) argue that, as a con- 
sequence of mass conservation, 



N(m 1 ,m 2 \M, V) w N(mi\M, V) N(m 2 \M - mi, V - v mi ), 

(A13) 

where v m is the volume associated with an m halo. 

When M m and V 3> v m , as it is in large overdense 
regions, then N(mi, m 2 |M, V) w N(mi\M,V) N(m 2 \M, V) 
i.e, N(mi, m 2 \M, V) is well approximated by the product of 
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the individual mean values. In this case, 
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In underdense regions, however, simply using the product of 
the individual mean values is expected to be a bad approx- 
imation. 

The smooth curves in Figure 1 show that the model 
developed above provides a good description of the envi- 
ronmental dependence of the dark matter correlation func- 
tion. In our comparisons, we have found that the full ex- 
pression (equations A11-A13) provides a substantially bet- 
ter description of Pih-\p in the underdense regions, whereas 
the simpler approximation of equation (A14) is adequate for 
the dense regions. 

A3 From dark matter to galaxies 

We now discuss how the model above can be extended to 
describe the environmental dependence of galaxy cluster- 
ing. When all environments have been averaged over, the 
mean number density of galaxies n ga i is given by replac- 
ing the weighting by m in equation (Al) for the mean 
density by {N ga i\m}, the mean number of galaxies in an 
m-halo. For instance, one could use equation (5) to write 
(Ngai\m) — 1 + (N s \m). Similarly, the weighting by m/p 
in P2h(k) is replaced with a weighting by (N ga i\m) /n ga i. 
And the weighting by (m/p) 2 in the 1-halo term becomes 
\2(N s \m) u{k\m) + (N s \m) 2 \u(k\m)\ 2 ]/n 2 gal . This weighting 
assumes there is always one galaxy at the centre, and that 
the number of satellite galaxies in an m-halo follows a Pois- 
son distribution with mean (N s \m). 

If one had an estimate of the dark matter density field, 
then one could include the environmental dependence of 
galaxy clustering by writing the mean number of galaxies in 
an m-halo surrounded by a region V containing mass M as 
(N ga i\m, M, V). If this mean number did not depend on M 
and V, then the environmental dependence of galaxy clus- 
tering would be described by making the same replacements 
in the expressions for P(k\S) as one makes for P(k). 

In practice, one observes galaxies, not dark matter, 
so one has an estimate of the galaxy density field 8 ga i = 
N/n g aiV — 1, and not of the dark matter S — M/pV — 1. 
Our previous expressions show what to do if the environ- 
ment as defined by the mass density S is known; describing 
galaxy clustering as a function of environment defined by 
the galaxies themselves, S ga i rather than by the dark matter 
5, is considerably more complicated. 

A 3.1 Effects of scatter in the S ga i — 5 relation 

However, a simple approximate model can be derived if one 
assumes that (N ga i\m, M, V) = (N ga i\m) is a monotonic 
function of m, and that the scatter around this mean rela- 
tion is small. The reason why is particularly easy to see if 



(N ga i\m) oc m and there is no scatter around this relation. 
If the environment of each galaxy is quantified by the num- 
ber N of other galaxies in a volume V around it, then rank 
ordering cells by N, which is an observable, is the same as 
rank ordering by M, which is not. This rank ordering allows 
us to describe the environmental dependence of galaxy clus- 
tering by making simple adjustments to the expressions of 
Section A2. 

In particular, suppose that one has measured how the 
number density of galaxies surrounded by regions containing 
at least N m i n other galaxies depends on N min . This number 
density is given by summing over the observable quantity 
N. However, this number density can also be written as 
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where M m i n (N m i n ) is obtained by requiring that the value 
of this expression match that observed as N min is varied. 
Once M m i n (Nmin) is known, the two-halo two-patch term 
can be written as 



jgal 
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x / dmN(m\M,V)(N gal \m), 
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with the analagous substitutions for the (m/p$) terms in 
equation (All) for P2h-ip{k\8). For similar reasons, the one- 
halo term in the centre plus Poisson satellites model is 

p lh\ k \ 5 )= dMn{M,V) dmN(m\M,V) 

J M min J() 

[2(N s \m) u{k\m) + (N s \m) 2 Hfc|m)| 2 ] 



n S-gal 



(A17) 



These expressions, which follow from those in Section A2, 
are only accurate if the relation between the number of 
galaxies in a cell (the observable) and the mass in the 
cell M is deterministic (i.e., has no scatter) and monotonic. 
There will be scatter in iV at fixed M if (iV S£l ;|m} is a mono- 
tonic but nonlinear function of m, even if there is no scatter 
around the (N ga i\m) relation. This scatter arises from the 
fact that there is scatter in the halo distribution at fixed 

M. (Write N = J2i N ( m i) and = M - This shows 

that N is independent of the distribution of the mj only if 
N(mi) oc mi.) 

Scatter in JV at fixed M means there is scatter in M at 
fixed N. The expressions above should provide reasonable 
approximations to the exact description if the mean mass 
M associated with a given value of N in a cell, (M\N), is 
a monotonic function with small scatter. In this context, 
'small' means that the dependence of clustering on envi- 
ronment (e.g., the mix of haloes) does not change dramat- 
ically over the range of environments associated with the 
rms scatter in M around the mean (M\N). To see what this 
means, recall that, if the scale on which the environment is 
defined is large, then equation (1) indicates that this depen- 
dence is proportional to b(m)(M/pV — 1). Massive haloes 
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Figure Al. Number of galaxies within spheres of radii 5h~ 1 Mpc 
centered on each galaxy, versus the total mass in the sphere, for 
the galaxy model described in Section 2.3. The small scatter in M 
at large TV suggests that our model should be reasonably accurate. 



have b(m) > 1, with b(m) a strongly increasing function 
of m, whereas low mass haloes have b(m) < 1, and the m- 
dependence is weak. Since the most massive haloes populate 
the densest cells, the model can tolerate larger scatter in the 
M — N relation at small N than at large N. 

Figure Al shows the relation between the number of 
galaxies N and the mass M within a sphere of radius 
5h~ Mpc centred on each galaxy, for the galaxy model de- 
scribed in Section 2.3. The figure indicates that treating N 
as a monotonic deterministic function of M (and vice- versa) 
is a good approximation at least at high masses, even though 
the underlying relation between number of galaxies and halo 
mass, {Ng a i\m), is nonlinear (equation 5). Notice that the 
scatter around the mean (M\N) relation is particularly small 
at large N. Although the scatter increases at smaller N, we 
argued above that, in this regime, the effect of scatter is less 
important. Hence, Figure Al suggests that the simple model 
developed in this section should be reasonably accurate. 

Recall that Figure 3 shows measurements of the envi- 
ronmental dependence of galaxy clustering made in a mock 
galaxy catalog constructed using the GIF simulation follow- 
ing methods outlined in Section 2.3. The smooth curves in 
Figure 3 show that the model developed here is in reasonable 
agreement with the measured dependence of £(r) on 5 ga i- 

A3. 2 The two contributions to the 2-halo term 

In constructing the model, we remarked that there were two 
types of contribution to the two-halo term. So one might 
wonder if there is some clear signature of the transition from 
one type of contribution to another. The mock galaxy sam- 
ple used for Figure 3 does not result in a with a clear 
inflection or break on the patch scale. 

However, such a transition is seen in Figure A2, which 
shows results for a mock galaxy sample in the VLS simu- 
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Figure A2. Correlation function for mock galaxies in the VLS 
simulations. Squares show §(r|(5) for the overdense sample and tri- 
angles are for the underdense sample. In the top panel, the density 
was defined by counting galaxies within spheres having radius 
5h _1 Mpc centred on each object, and the bottom panel used 
8h~ Mpc spheres. Curves show the associated analytic model 
which is able to describe the three different clustering regimes. 



lations, constructed with M m %n — 10 ' h~ Mq and a = 
1.39, chosen to be similar to galaxies more luminous than 
M r = —21. The top and bottom panels show results from 
analyses in which the large scale environment was defined 
by the mass within spheres of radii 5/i -1 and 8fa -1 Mpc, 
respectively. The inflection indicating the transition from 
£,2h-ip(r\8) to £,2h-2p(r\8) is more clearly seen for low density 
regions, and it is clearer when the scale which defines the 
environment is smaller. For these more luminous galaxies, 
using the simpler equation (A14) in P2h-i P is an excellent 
approximation in dense regions, but grossly overestimates 
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(by at least a factor of two) the clustering strength on inter- 
mediate scales in underdense regions. In underdense regions, 
equations (A11)-(A13) are substantially more accurate. We 
would like to point out that the relation between N and M 
(that was mentioned in the previos section) within spheres 
of 8/i -1 Mpc, shows more scatter at large N. As the model 
is more sensitive to larger scatter at large N, we believe this 
causes the analytical prediction to depart from the simula- 
tion results at small scales, as shown in the lower panel of 
Figure A2. 

Figure 1 indicates that our model for the environmen- 
tal dependence of dark matter clustering is accurate, and 
the agreement between the symbols and the curves in Fig- 
ures 3 and A2 indicates that our approximate model for the 
environmental dependence of galaxy clustering also works 
quite well. Although it is reassuring that the model works 
so well — it suggests that we understand the physics which 
drives the environmental dependence of clustering — this an- 
alytic description is not needed to perform the test of envi- 
ronmental effects described in the main text. 



A4 Consistency checks 

This section shows that the expressions in Section A2 do 
reduce to the standard expressions in Section Al upon av- 
eraging over all environments. 
The mass density is 
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when M m i n — ► and M max — > oo (this limit corresponds to 
averaging over all environments). In this limit, the one-halo 
term is 

{• oo 

Pih(k\S) / dMn(M,V) 

Jo 

i-U / \ 2 

x J dmN(m\M,V)y—j \u{k\m)\ 2 



■Jo 



dm ( — ) \u(k\m)\ 



x / dMn{M, V)N{m\M, V) 

J rr 



Similarly, the two halo term becomes 
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This can be simplified as follows. The halo overdensity is 

5 h (m\M,V) = dNMM,V)/dm ^ ^ & 
Vdn(m)/dm 

When V » 1, then S <§C 1, and M is almost surely much 
larger than the typical halo mass, so M » m for most values 
of m. In this limit, 

dN(m,S c \M, V)/dm _ ^ ^ dn[m, 8 C — 8 (5)}/dm 
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where we have used the fact that Jo rj j C 1. Hence, 
we can approximate 8h(m) w b(m)8 where b(m) ~ 1 - 
d\nn(m,Sc)/d8 c (equation 2) is no longer a function of V. 
Our assumption is that B(M) is related to n(M,V) simi- 
larly. Thus, the second integral in the expression above be- 
comes 
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If we do not truncate the two-halo term on small scales, 
as a crude approximation for p2h-i P , i.e., if we simply set 
PLin{k\R p ) — » PLin(k), then this agrees with equation (A4). 

If we do truncate, then we must check if the large-scale 
limit of P2h-i P + P2h-2 P agrees with equation (A4). In the 
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limit of large patches and all environments, the two-halo 
one-patch term becomes 
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On large scales 8 <JC 1 so we can ignore bS compared to unity. 
Since M/p = V{l + 5), 
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where the second line follows from the fact that if M is much 
larger than the mass of the largest halo, then the upper limit 
of the integrals over m can safely be changed from M to oo, 
and the final line uses the fact that the integral over M 
which remains is the same as the integral over the counts- 
in-cells probability distribution, and so equals unity. This 
shows explicitly that, in this limit, P2h-i P + Pih-2 P agrees 
with equation (A4). 



